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Abstract — The error pattern correcting code (EPCC) can be 
constructed to provide a syndrome decoding table targeting the 
dominant error events of an inter-symbol interference channel at 
the output of the Viterbi detector. For the size of the syndrome 
table to be manageable and the list of possible error events to 
be reasonable in size, the codeword length of EPCC needs to 
be short enough. However, the rate of such a short length code 
will be too low for hard drive applications. To accommodate the 
required large redundancy, it is possible to record only a highly 
compressed function of the parity bits of EPCC's tensor product 
with a symbol correcting code. In this paper, we show that the 
proposed tensor error-pattern correcting code (T-EPCC) is linear 
time encodable and also devise a low-complexity soft iterative 
decoding algorithm for EPCC's tensor product with g-ary LDPC 
(T-EPCC-gLDPC). Simulation results show that T-EPCC-gLDPC 
achieves almost similar performance to single-level gLDPC with 
a 1/2 KB sector at 50% reduction in decoding complexity. 
Moreover, 1 KB T-EPCC-gLDPC surpasses the performance of 
1/2 KB single-level gLDPC at the same decoder complexity. 

Index Terms — Tensor product codes, inter-symbol interfer- 
ence, turbo equalization, error-pattern correction, g-ary LDPC, 
multi-level log likelihood ratio, tensor symbol signatures, 
signature-correcting code, detection postprocessing. 

I. Introduction 

THe advent of high recording density enabling technolo- 
gies, pioneered by galloping improvements in head and 
media design and manufacturing processes, has pushed for 
similar advances in read channel design and error correc- 
tion coding, driving research efforts into developing channel- 
capacity-approaching coding schemes based on soft iterative 
decoding that are also implementation friendly [1], [2]. Soft 
decodable error correction codes (ECC), mainly low den- 
sity parity check (LDPC) codes, would eventually replace 
conventional Reed-Solomon (RS) outer ECC, which despite 
its large minimum distance, possesses a dense parity check 
matrix that does not lend itself easily to powerful belief 
propagation (BP) decoding. There exists vast literature on the 
various design aspects of LDPC coded systems for magnetic 
recording applications. This includes code construction [3]- 
[6], efficient encoding [7], [8], decoder optimization [9]- 
[11], and performance evaluation [12]-[14]. In this work, we 
propose an LDPC coded system optimized for the magnetic 
recording channel that spans contributions in most of these 
areas. 
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The error-pattern correcting code (EPCC) is proposed 
in [15]-[17] motivated by the well-known observation that the 
error rate at the channel detector output of an ISI channel is 
dominated by a few specific known error cluster patterns. This 
is due to the fact that the channel output energies associated 
with these error patterns are smaller than those of other 
patterns. A multiparity cyclic EPCC was first described in 
[16], with an RS outer ECC, possessing distinct syndrome 
sets for all such dominant error patterns. To reduce the 
code rate penalty, which is a severe SNR degradation in 
recording applications, a method to increase the code rate was 
introduced in [17] that also improved EPCC's algebraic single 
and multiple error-pattern correction capability. In this method, 
the generator polynomial of the short base EPCC is multiplied 
by a primitive polynomial that is not already a factor of the 
generator polynomial. Also, the primitive polynomial degree 
is chosen so as to achieve a certain desired codeword length. 
Moreover, [17] describes a Viterbi detection postprocessor that 
provides error-event-reliability information aiding syndrome- 
mapping of EPCC to improve its correction accuracy. How- 
ever, improving the EPCC code rate by extending its codeword 
length increases the probability of multiple dominant error 
patterns within the codeword, and this requires increasing 
the size of the syndrome table considerably to maintain the 
same correction power, which eventually results in prohibitive 
decoding complexity. To maintain correction power with a 
manageable size syndrome decoding table, [18] discusses a 
more efficient method based on a list decoding strategy that 
delivers satisfactory sector error rate (SER) gain with an outer 
RS ECC. Later, this list decoding scheme was formulated as 
a soft-input soft-output block in [19] and utilized to enhance 
the performance of turbo equalization based on convolutional 
codes (CC). Nevertheless, the serial concatenation scheme that 
proved successful with RS hard decoding and CC-based turbo 
equalization does not work as well in serial concatenation of 
long-EPCC and LDPC. The reason is that when the LDPC 
decoder fails, especially in the water-fall region, the sector 
contains a large number of multiple error occurrences. When 
many such error events occur in a given EPCC codeword, 
decoding by any reasonable size list decoder is formidable. 
Thus, an inner EPCC cannot in any capacity reduce the SER 
of a serially concatenated outer LDPC. On the other hand, if 
the EPCC codeword length is decreased substantially, then the 
number of errors per codeword is reasonable, as long as the 
overall code rate is somehow kept high. Here, the concept of 
tensor product construction comes into play. 

Tensor product parity codes (TPPC) were first proposed in 
[21] as the null-space of the parity check matrix resulting 
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from the tensor product of two other parity check matrices 
corresponding to a variety of code families. As a result, based 
on the choice of the concatenated codes, TPPC would be 
classified as an error correction code if constructed from two 
ECCs, an error detection code (EDC) if constructed from two 
EDCs, and an error location code (ELC) if constructed from an 
EDC and an ECC in a special order As a matter of fact, ELCs 
were introduced earUer in [24] and their algebraic properties 
studied in detail, but later incorporated in the unified theme 
of TPPCs in [21]. Furthermore, a generalized class of hard- 
decodable ELCs was later suggested for application in the 
magnetic recording channel in [25]. In addition, TPPCs can 
be generalized by combining a number of codes on various ex- 
tension fields with shorter binary codes. For this more general 
case, a decoding algorithm was developed in [26]. An ECC- 
type TPPC was applied to longitudinal magnetic recording 
in [22], and to perpendicular magnetic recording in [23]. 
In [22], a hard decodable tensor product code based on a single 
parity code and a BCH code is proposed as an inner code for 
RS. This code is suitable for low density longitudinal recording 
channels for which dominant errors have odd weights, such as 
{+} and {+, — , +}. Also, |22| proposes that the hard decoder 
passes the corrected parity bits to a Viterbi detector reflecting 
channel states and parity code states in order to compute 
the decoder output. Later, [23] presented two methods for 
combining a tensor-product single parity code with a distance- 
enhancing constrained code. This code combination achieved 
more satisfactory performance with RS as an outer code in 
high density perpendicular recording channels. 

Our goal in this work is to utilize the concept of tensor 
product concatenation to construct high rate soft-decodable 
EPCCs on die symbol-level of die outer ECC. The EPCC tar- 
get error list is matched to the dominant error events normally 
observed in high density perpendicular recording channels. 
Since dominant error events in perpendicular recording are 
not only of odd weight [12], this requires that our EPCC be 
a multiparity code. However, in this case, a Viterbi detector 
matching the channel and parity will have prohibitive complex- 
ity. In spite of this, the performance of the optimal decoder of 
the baseline parity-coded channel can be approached by the 
low complexity detection postprocessing technique in [18]. 
We also present in detail a low complexity highly parallel 
soft decoder for T-EPCC and show that it achieves a better 
performance-complexity tradeoff compared to conventional 
iterative decoding schemes. 

A. Notations and Definitions 

• For a linear code C : {n,k,p), n denotes the codeword 
length, k denotes the user data length, and p = n — k 
denotes the number of code parity bits. 

• For a certain parity check matrix H corresponding to a 
linear code {C : He* = 0, Vc G C}, a syndrome s is the 
range of a perturbation of a codeword H{c + e)* = s. A 
signature refers to the range under H for any bit block, 
not necessarily a codeword formed of data and parity bits. 

• The multilevel log-likelihood ratio (mlLLR) of a random 
variable /3 G GF{q) corresponding to the p.m.f. (proba- 
biUty mass function) = Pr(^ = i),J2iZoPiil^) = 



I, can be defined as: = i) = log(|^), 7(/3 = 0) = 
0. 

• [x]^ denotes a local segment [xi,Xij.i, ...,Xj] of the 
sequence Xk- 

• The period of a generator polynomial on GF(2) corre- 
sponding to a linear code is equal to the order of that 
polynomial, as defined in [36]. Also, for a syndrome set 
{s'^^^jfSQ that corresponds to all the L possible starting 
positions of an error event, the period P is defined as the 
smallest integer such that s'^p+^^ = s'^p^ [18]. 

• Assume = log(a) and (3l = \og{(3), then (a + 

/3)l = log(e"^ + e^^). Define max* ^ = 

(a + I3)l = max(aL,/3L) + log(l + e-\°'^-f^^\). Also, 

max* ( {7fc}t=o ) and max* (7^) are two different rep- 
resentations of the recursive implementation of max* 
acting on the elements of the set {7fe}|^^. 

B. Acronyms 

• TPPC: Tensor Product Parity Code. 

• g'LDPC: Q'-ary Low Density Parity Check code. 

• RS: Reed Solomon code. 

. BCJR: Bahl-Cocke-Jelinek-Raviv . 

• T-EPCC: Tensor product Error Pattern Correction Code. 
. T-EPCC-gLDPC and T-EPCC-RS: Tensor product of 

EPCC and gLDPC or RS, respectively. 

• LLR: Log-LikeUhood Ratio. 

• mlLLR: multi-level Log-Likelihood Ratio. 

• ML: Maximum Likelihood. 

• MAP: Maximum A Posteriori. 

• QC: Quasi-Cyclic. 

• SPA: Sum-Product Algorithm. 

II. Review of EPCC and the Tensor Product 

Coding Paradigm 

In this section we give a brief review on the concept of 
EPCC, including the design of two example codes that will 
be utilized later in the simulation study. Also, we review 
the tensor product coding paradigm and present an encoding 
method that allows for EPCC-based Unear-time-encodable 
TPPCs. 

A. EPCC Review and Examples 

We review constructing a cyclic code targeting the set of 
Imax dominant error events 

{e'~^\xUk\x),...,el--\x)} 

represented as polynomials on GF{2) that can occur at any 
starting position A: in a codeword of length It- A syndrome 
of error type e^'^\x) at position k is defined as s^^\x) = 
e^^\x) mod g{x), where g{x) is the generator polynomial of 
the code and mod is the polynomial modulus operation. 

A syndrome set S*^*-' for error type e(*^(x) contains elements 
corresponding to aU cyclic shifts of polynomial e^^\x); ele- 
ments of S^'^ are thus related by s^^^j = x^ s^^^ mod ^(a;). 
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Fig. 1. TPPC Ci(ni,fci,pi) ^ C2{n2, k2,P2) codeword structure 



For unambiguous decoding of e'*^(a;) and e*^^-'(x), V{j,j}, 
we must have S^*^ fl S*--'^ — 0. This design requirement 
constrains g{x) to have distinct greatest common divisors with 
6*^*^(0;), for all targeted i [16]. However, even if this constraint 
is satisfied, an element in S^*' can still map to more than 
one error position, i.e., the period of the syndrome set- and 
period of g{x)- can be less than It- Moreover, this constraint 
is only sufficient but not necessary. As shown in [16], there 
may exist a lower degree g{x) that can yield distinct syndrome 
sets for the targeted error polynomials, resulting in a higher 
rate EPCC. A search method to find this g{x) is already 
discussed in detail in [16] and [18]. We next give two example 
EPCC constructions that will be used throughout the paper. 
We target the dominant error events of the "ideal" equalized 
monic channel 1 + 0.85-D in AWGN, which is suitable as 
a partial response target in perpendicular magnetic recording 
read channels. For this channel, the dominant errors are given 
by: e(i)(x) = 1, e(^)(x) = 1 + x, e^^'>{x) = I + x + x'^, 
etc., i.e. they can be represented as polynomials on GF{2) 
for which all powers of x have nonzero coefficients. The two 
EPCCs ai-e: 

• Example 1: Targeting error polynomials up to degree 
4, we get the generator polynomial g{x) = 1 + .t + 
x^ + x^ + x^ of period 12 via the search procedure 
of [16]. Choosing a codeword length of 12, 5 distinct, 
non-overlapping syndrome sets are utilized to distinguish 
the 5 target errors. Then, syndrome set S'"^^ will have 
period 6, while all other sets have period 12. A syndrome 
set of period 6 means that each syndrome decodes to one 
of 2 possible error positions within the 12-bit codeword. 
Nonetheless, e'^'^^(x) can be decoded reliably via channel 
reliability information and the polarity of data support. 
The low code rate of 0.5 makes this code unattractive as 
an inner code in a serial concatenation setup for recording 
channel applications. However, as we will see later, a 
tensor code setup makes it practical to use such powerful 
codes for recording applications. 

• Example 2: Targeting error polynomials up to degree 9, 
we have to record more redundancy. To accomplish this 
feat, a cyclic code with 8 parity bits, code rate 0.56, and a 
generator polynomial g{x) = 1 + x'^ + x^ + x^ + x^ + x^ 
of period 18 is found by the search procedure in [16]. 
Then, syndrome sets S^^), S^^), S^^^, and S'''' each have 
period 18 and thus can be decoded without ambiguity. 
While syndrome sets S^^), S''^', S^^), S^^\ and S(i"' 
each have period 9, decoding to one of two positions. 
The worst is S'^^ of period 2, which would decode to one 
of 9 possible positions. Still, the algebraic decoder can 
quickly shrink this number to few positions by checking 
the data support, and then would choose the one position 
with highest local reliability. 



B. Tensor Product Parity Codes 

1 ) Construction and Properties of the TPPC Parity Check 
Matrix: Consider a binary linear code Ci : (rii, fci,pi) derived 
from the null space of parity check matrix Hc^ , and assume Ci 
corrects any error event that belongs to class ei. Also, consider 
a non-binary linear code C2 : ('^2 , ^2 , P2 ) derived from the null 
space of parity check matrix iJca and defined over elements 
of GF{2P^). Moreover, assume this code corrects any symbol 
error type that belongs to class £2- As a preliminary step, 
convert the binary pi x ni matrix Hc^, column by column, 
into a string of GF{2'p^) elements of dimension 1 x rii. Then, 
construct the matrix 

as a p2 X n-i"-2 array of GF{2'p^) elements. Finally, convert 
the elements of into pi-bit columns, based on the same 
primitive polynomial of degree pi used all over in the con- 
struction method. The null space of the piP2 x nin2 binary 
He, corresponds to a linear binary code C3 : (nin2, ^3,^1^2)- 
As shown in Fig. [1] a C3 codeword is composed of n2 blocks 
termed "tensor-symbols", each having ni bits. Also, it can be 
shown that C3 can correct any collection of tensor symbol 
errors belonging to class 62, provided that all errors within 
each tensor symbol belong to class £1 [21]. Note that a tensor 
symbol is not an actual Ci codeword, and as such, using 
the terms "inner" and "outer" codes would not be completely 
accurate. In addition, the tensor symbols are not codewords 
themselves, as can be seen in Fig. [T] the first k2 tensor symbols 
are all data bits to start with, and even the last p2 tensor 
symbols, which are composed of data and parity bits, have 
non-zero syndromes under Hc^ . Thus, a TPPC codeword does 
not correspond directly to either iJci or Hc2, and as a result, 
the component codebooks they describe are not recorded 
directly on the channel. Another interesting property of the 
resulting TPPC is that the symbol-mapping of the sequence 
of tensor-symbol signatures under Hc^ forms a codeword of 
C2, which we refer to as the "signature-correcting component 
code". 

2} Encoding of Tensor Product Parity Codes: The encoding 
of a TPPC can be performed using its binary parity check 
matrix, but the corresponding binary generator matrix is not 
guaranteed to possess algebraic properties that enable linear 
time encodability. Thus, an implementation-friendly approach 
would be to utilize the encoders of the constituent codes, 
which can be chosen to be linear time encodable. 

Consider a binary code Ci : (ni,fci,pi) that is the null 
space of parity check matrix H^^, and a non-binary code 
G2 ■ {n2,k2,P2) defined on GF{2'p^), the tensor-product 
concatenation is a binary C3 : (71,3,^3,^3), where: 

= n2 X ni,fc3 rii x ^2 - pi x p2,P3 = Pi x p2 
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Fig. 2. TPPC encoder of Ci (ni, fei, pi) (?) C2(n2i ^2; P2):(a) signatures calculated under Hc^ , then the pi-bit signatures are mapped to GF{2P^ ) symbols, 
(b) k2 signatures encoded by generator matrix Gc2 into a C2 codeword, then parity symbols are mapped back to GF{2), (c) pi X p2 TPPC parity bits are 
calculated by back substitution. 



Assume that Ci is a cyclic code, and C2 is any of the 
linear time encodable codes, where we choose a quasi-cyclic 
(QC) component code for the purpose of this study. Then, 
the encoders of Ci and C2 communicate via the following 
algorithm to generate a codeword of C3, see Fig. |2] 

(i) Receive a block of ni x fc2 + fci x 712 — fci x ^2 bits from 
the data source, call it major block a. 

(ii) Divide major block a into minor block (3 of ni x k2 bits, 
and minor block 7 of fci x 71,2 — fci x k2 bits (i.e. ki x p2 
bits). 

(iii) Divide block /3 into ^2 columns each of ni bits. Then, for 
each column, calculate the intermediate pi-bit signature 
under the parity check matrix of Ci. Using a feedback 
shift register (FSR) to calculate the signatures, the com- 
putational cost is oc ni operations per signature, and 
oc rii X ^2 for this entire step. 

(iv) Convert intermediate signatures from pi-bit strings into 
GF{2P^) symbols. 

(v) Encode the ^2 non-binary signatures into a C2 codeword 
of length 712. Using FSRs to encode the quasi-cyclic C2, 
the computational complexity of this step is cx ri2. 

(vi) Convert computed signatures back into pi-bit strings. 

(vii) Divide block 7 into p2 columns each of fci bits. Add pi 
blanks in each column to be filled with the parity bits 
of C3. Then, align each column with the p2 signatures 
computed in the previous step, leaving pi blanks in each 
column. 

(viii) Fill blanks in the previous step such that the signature of 
data plus parity blanks under Ci equals the corresponding 
aligned signature from step (vi). The parity can be calcu- 
lated using the systematic and the method of back 
substitution which requires a computational complexity 
oc ni per column. 

The total computational complexity of this encoding algo- 
rithm is oc ni X /c2 + + X P2, i-e. it is oc ni x 71,2 = n^, 
which is the TPPC codeword length. Thus, we have shown- 
with some constraints- that if Ci and C2 are linear time 
encodable, then C3 = Ci ® C2 is linear time encodable. 



III. T-EPCC-RS Codes 

To demonstrate the algebraic properties of TPPC codes, we 
present an example code suitable for recording applications 
with 1/2 KB sector size. Consider two component codes: 

« A binary cyclic (18, 10) EPCC of example 2 above with 
rate 0.556, 8 parity bits, and parity check matrix in 
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. A (255, 195) RS over GF{2^), of rate 0.765, t = 30, and 
60 parity symbols. 
The resulting TPPC is a binary (4590,4110) code, of rate 
0.896, and redundancy of 480 parity bits. For this code, a 
codeword is made of 255 18-bit tensor symbols, of which, any 
combination of 30 or less tensor symbol errors are correctable, 
provided that each 18-bit tensor symbol has a single or 
multiple occurrence of a dominant error that is correctable by 
EPCC, those being combinations of error polynomials up to 
degree 9. Furthermore, although the EPCC constituent code 
has a very low rate of 0.556, the resulting T-EPCC has a 
high rate of 0.895. Notably, in the view of the 18-bit EPCC, 
this 61% reduction in recorded redundancy corresponds to an 
SNR improvement of 2 dB in a channel with rate penalty 
~ 101og]^Q(l/i?), and 4.1 dB in a channel with rate penalty 

^101ogi„(l/i?2). 

A. Hard Decoding of T-EPCC-RS Codes 

Hard decoding of T-EPCC-RS directly reflects the code's 
algebraic properties, and thus, serves to further clarify the 
concept of tensor product codes. Hence, we discuss the hard 
decoding approach before going into the design of soft decod- 
ing of T-EPCC codes. The decoding algorithm is summarized 
by the following procedure, see Fig. |3] 
(i) After hard slicing the output of the Viterbi channel 
detector, the signature of each tensor symbol is calculated 
under Hepcc- Each signature is then mapped into a Galois 
field symbol, where the sequence of non-binary signatures 
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Fig. 3. Hard decoder of (18, 10, 8) EPCC (g) (255, 195, 2t) RS of t = 30 over GF(2**). 



constitute an RS codeword - that is if the channel detector 
did not suffer any errors. 

(ii) Any hard-input RS decoder, such as the Berlekamp- 
Massey decoder, acts to find a legitimate RS codeword 
based on the observed signature-sequence. 

(iii) If the number of signature-symbols in error is larger than 
the RS correction power, RS decoding fails and the tensor 
product decoder halts. 

(iv) Otherwise, if RS decoding is deemed successful, the 
corrected signature-symbol sequence is added to the 
original observed signature-symbol sequence to generate 
the "error syndrome-symbol" sequence. 

(v) Each error syndrome-symbol is mapped into an EPCC 
bit-syndrome of the corresponding tensor symbol. 

(vi) Finally, EPCC decodes each tensor symbol to satisfy the 
error-syndrome generated by the component RS, in which 
it faces two scenarios: 

• A zero "error-syndrome" at the output of RS decoding 
indicates either no error occurred or a multiple error 
occurrence that has a zero EPCC-syndrome, which 
goes undetected. In this case, the EPCC decoder is 
turned off to save power. 

• A non zero "error-syndrome" will turn EPCC correc- 
tion on. If the error-syndrome indicates a single error 
occurrence in the target set, then, the EPCC single error 
algebraic decoder is turned on. On the other hand, if 
the error-syndrome is not recognized, then EPCC list 
decoding is turned on with a reasonable-size list of test 
words. 

Note that although the number of EPCC codewords 
(tensor symbols) is huge, the decoder complexity is 
reasonable since EPCC decoding is turned on only for 
nonzero error-syndromes. 

IV. T-EPCC-gLDPC CODES 

We learned from the design of T-EPCC-RS that the compo- 
nent signature-correcting codeword length can be substantially 



shorter than the competing single level code. Although the 
minimum distance is bound to be hurt if the increased redun- 
dancy does not compensate for the shorter codeword length, 
employing iterative soft decoding of the component signature- 
correcting code can recover performance if designed properly. 
While LDPC codes have strictly lower minimum distances 
compared to comparable rate and code length RS codes, 
the sparsity of its parity check matrix allows for effective 
belief propagation (BP) decoding. BP decoding of LDPC 
codes consistently performs better than the best known soft 
decoding algorithm for RS codes. Since the TPPC expansion 
enables the use of 2 to 4 times shorter component LDPC 
compared to a competing single level LDPC, a class of LDPC 
codes efficient at such short lengths are critical. LDPC codes 
on high order fields represent such good candidates. In that 
respect, [29] showed that the performance of binary LDPC 
codes in AWGN can be significantly enhanced by a move 
to fields of higher orders (extensions of GF{2) being an 
example). Moreover, [29] established that for a monotonic 
improvement in waterfall performance with field order, the 
parity check matrix for very short blocks has to be very sparse. 
Specifically, column weight 3 codes over GF{q) exhibit worse 
bit-error-rate (BER) as q increases, whereas column weight 
2 codes over GF{q) exhibit monotonically lower BER as q 
increases. These results were later confirmed in [30], where 
they also showed through a density evolution study of large q 
codes that optimum degree sequences favor a regular graph 
of degree 2 in all symbol nodes. On the other hand, for 
satisfactory error floor performance, we found that using a 
column weight higher than 2 was necessary. This becomes 
more important as the minimum distance decreases for lower 
q. For instance, we found that a column weight of 3 improved 
the error floor behavior of G'F(2^)-LDPC at the expense of 
performance degradation in the waterfall region. 
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A. Design and Construction of qLDPC 

The low rate and relatively low column weight design of 
gLDPC in a TPPC results in a very sparse parity check 
matrix, allowing the usage of high girth component gLDPC 
codes. To optimize the girth for a given rate, we employ 
the progressive edge growth (PEG) algorithm [30] in gLDPC 
code design. PEG optimizes the placement of a new edge 
connecting a particular symbol node to a check node on 
the Tanner graph, such that the largest possible local girth 
is achieved. Furthermore, PEG construction is very flexible, 
allowing arbitrary code rates, Galois field sizes, and column 
weights. In addition, modified PEG-construction with linear- 
time encoding can be achieved without noticeable performance 
degradation, facilitating the design of linear time encodable 
tensor product codes. Of the two approaches to achieve linear 
time encodability, namely, the upper triangular parity check 
matrix construction [30] and PEG construction with a QC 
constraint [31], we choose the latter approach, for which 
the designed codes have better error floor behavior. T-EPCC- 
gLDPC lends itself to iterative soft decoding quite naturally. 
Next, we present a low complexity soft decoder utilizing this 
important feature. 

B. Soft Decoding of T-EPCC-qLDPC 

To fully utilize the power of the component codes in T- 
EPCC-gLDPC, we need to develop a soft iterative version of 
the hard decoder of T-EPCC-RS. To limit the complexity of the 
proposed soft decoder, sub-optimal detection post-processing 
is adopted instead of the maximum a posteriori (MAP) de- 
tector to evaluate tensor symbol signature reliabilities. The 
complexity of the optimal MAP detector matched to both 
the channel of memory length L and Hepcc of row length 
p is exponential in p + L — 1. We present a practical soft 
detection scheme that separates soft channel detection from 
tensor symbol signature detection, though, through a compo- 
nent signature-correcting LDPC in a TPPC setup, approaches 
the joint MAP performance through channel iterations. The 
main stages of the decoder are, see Fig. ID 

(1) Detection postprocessing: 

> Utilizing a priori information from the previous de- 
coding iteration, binary Viterbi generates the hard ML 
word based on channel observations, for which the 
error sequence is calculated and passed to the correlator 
bank. 

• A bank of local correlators estimates the probability 
of dominant error type/location pairs for all positions 
inside each tensor symbol. 

(2) Signature p.m.f. calculation: 

• For each tensor symbol, the list of most likely error 
patterns is constructed. This list includes single occur- 
rences and a predetermined set of their combinations. 
The list is then divided into sublists, each under the 
signature value it satisfies. 

• For each tensor symbol, using each signature value's 
error likeUhood list, we find the signature p.m.f. of that 
symbol. 



(3) g-ary LDPC decoding: 

• Using the observed sequence of signature p.m.f.'s, we 
decode the component g-ary LDPC via FFT-based 
SPA. 

• For each tensor symbol, the LDPC-corrected signature 
p.m.f. is convolved with the observed signature p.m.f. 
at its input to generate the error-syndrome p.m.f.. 

(4) EPCC decoding: 

« For each tensor symbol, we find the list of most 
probable error-syndromes and generate a list of test 
error words to satisfy each syndrome in the list. 

• A bank of parallel EPCC single-error correcting de- 
coders generates a list of most probable codewords 
along with their reUabiUties. 

(5) Bit-LLR feedback: 

• Using the codeword reliabilities we generate bit-level 
reliabilities that are fed back to the Viterbi detector 
and the detection postprocessing stage. Those bit-level 
reliabilities, serving as a priori information, favor paths 
which satisfy both the ISI and parity constraints. 

We explain each of these steps in the following sections, but 
we replace any occurrence in the text of syndrome (signature) 
p.m.f. by syndrome (signature) multi-level log-likelihood ra- 
tios (mlLLR), as decoding will be entirely in the log domain 
for reasons explained below. 



Hard-decision 
binary Viterbi 



C '■ ML word 



C =£ + [€<" ]f 



C(E 

H* h 

C(E' 



u S 

^ I, 



§1 



j + l-l 



J') 



i = (f + l)x/j -1 



Fig. 5. Bank of parallel error-matched correlators to find error pattern 
type/position reliabilities. 

1) Detection Postprocessing: At this decoder stage we 
prepare a reliability matrix C (E) for error type/position pairs 
- captured in a tensor symbol of length It- that is usable by the 
next stage to calculate the tensor symbol's signature mlLLR: 



CiE) = 

1 
2 

Im.n.T. 







It 

=(1) 



where C(e[.*'') is the error pattern (type i/ position k) reliability 
measure computed by the maximum a posteriori (MAP)- 
based error-pattern correlator shown in Fig. |5] The bank of 
local correlators discussed here was also employed in [18] 
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Fig. 4. T-EPCC-gLDPC soft decoder of (12,6,6) EPCC (g) (380, 323) GF(2'')-LDPC. 
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for AWGN channels, and in [19] for data-dependent noise 
environments. We now discuss how to generate these local 
metrics. Let be the channel detector input sequence r^, = 
Cfc * hk + Wk, where Ck is the bipolar representation of 
the recorded codeword sequence, hk is the partial response 
channel of length Ih, and Wk is zero-mean AWGN noise with 
variance cr^. Also, let qk ^ rk~{ck*hk) = {ck-Ck)*hk+Wk 
be the channel detector's output error sequence. If a target 
error pattern sequence ej. occurs at positions from k = j to 
k = j + li — 1, then qk can be written as 



maximize becomes 



Wk 



[S«]f' ■ + Wk 



Wk 



(2) 



where s 



(i) . 



is the channel response of the error sequence, and 
is given by Sj, — ej^ * hk, and /f — li + lh~2. Note that we 
define the start of the tensor symbol at j = 0. So, if j < 0, 
then the error pattern starting position is in a preceding tensor 
symbol. 

The reliability for each error pattern with starting position, 
j, can be computed by the local a posteriori probabilities 
(ignoring tensor symbol boundaries for now): 



Pr([e«]f' ' 



= Pr [i 



,(■01^'+'" 



(3) 



The most likely assumed error type/position pair in a tensor 
symbol maximizes the a posteriori probability ratio of its 
reliability to the reliability of the most probable error event 
(the competing event in this case would be the ML word itself, 
with no error occurrence assumed at the output of Viterbi 
detection). Hence, utilizing (|3]l and Bayes rule, the ratio to 



Pr(ef 






Pr 


'[ML word]j+''"^ 


c, 





Pr 






Pr([aC>]f' f) 


Pr 






P,-([i«l]j+'!) 



(4) 



where [s'*)]^^'' is the ML word's noiseless channel response. 

Given the noise model, [qj^ * is a sequence of independent 
Gaussian random variables with variance cr^. Therefore, max- 
imizing (|4]l can be shown to be equivalent to maximizing the 
log-likelihood local measure [18]: 



k=j 

where the a priori bias in (|5]l is evaluated as 

Pr([sW]f' 



Pr([i^'^]r'-) 

Pr([sW]f'0 
(5) 



log- 



Pr([sW]^. 



E 

k=j,Ck=+l 



E 

k=j,Ck=- 



(6) 



where Xk is the a priori LLR of the error-event bit at position 
k as received from the outer soft decoder, and we are assuming 
here that error event sequences do not include bits, i.e., the 
ML sequence and error sequence do not agree for the entire 
duration of the error event. Equation (|5]l represents the "local" 
error-pattern correlator output in the sense that it essentially 
describes the correlator operation between qk and the channel 

(i) 

output version of the dominant error pattern ' within the 
local region + Z^]. However, equation (|5]l ignores that 
errors can span tensor symbol boundaries when j < or j + 
k ~ I > /t — 1- For instance, an error in the first bit of 
the tensor symbol can result from a single error event in that 
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bit, a double error event in the last bit of the preceding tensor 
symbol, a triple error event occurring two bits into the previous 
symbol, and so on. Hence, the probability of an error in the 
first bit is the sum of all these parent error event probabilities. 
Moreover, this can be easily generahzed to boundary errors 
extending beyond the first bit. In a similar manner, an error in 
the last bit of a tensor symbol can result from a single error 
event in that bit, a double error event starting in that bit and 
continuing into the next tensor symbol, a triple error starting 
at the last bit and continuing into the next tensor symbol, and 
so on. Again, the probability of an error event in that bit is the 
sum of the probabiUties of all these parent events. Moreover, 
we have to nullify the probability of the parent error events in 
the modified rehabihty matrix since they are already accounted 
for in the last bit's reliability calculation. Furthermore, this can 
also be generalized to error events starting earlier than the last 
bit and extending into the next tensor symbol. In summary, 
to calculate a modified metric relevant to the current tensor 
symbol, we utilize the following procedure: 
. at j = 0, modify C(e[,'^) 



C(e«)= W (c(eL'i+,)), 

max is the maximum 



independently for each i, where I 
length of a targeted error pattern. 
Starting at i = 1 and j ~ It — 1, do: 



(i) C(ef^) = ma.T* {C{e)^') 



i„ 



k—i 



(ii) V/c > i, set C{ef'>) 



-00. 



(iii) Set i = i + l, j=j — 1. 

(iv) If i < l„iax go back to (i). 

We assume here that dominant error events span only two 
tensor symbols at a time and that they do not include error free 
gaps, which is certainly true for the case study of this paper. 
Following this procedure we obtain the modified reliability 
matrix C (E). 

2) Signature mlLLR Calculation: For each tensor symbol 
i, utilizing C (E), we need to find the p.m.f. or the log 
domain mlLLR of its signature Sg., e GF{2P""=''), for EPCC 
with Pepcc parity bits. To limit the computational complexity 
of this calculation, we construct a signature only from the 
dominant errors and a subset of their multiple occurrences. 
Denote Pr{Sgi = a^^^) as the running estimate of the p.m.f. 
at q'-\ and ^{Sgi = a^'^) = log{Pr{Sgi = a^'^)) as the 
running estimate of mlLLR. Denote a one dimensional index 
of C (E) as p'"'^ = (pc X l„iax) + Pr Corresponding to the 
Pr-th row and Pc-th column of C {E) and error E{p^'^). We 
choose the dominant list as the L patterns with the largest 
corresponding elements of C (E) having indexes {Pi'^}lZi- 
Based on this list, we developed the following procedure to 
compute ^{Sgi = a^~^): 

• Step 1 (Single occurrences): 



7(% = a^-i) = 
V/c : Gf {Hepcc 

Qepcc 



PL 

max 

k=pl' 



X c. 



(c'(fc); 



(7) 



where qepcc = 2^""^", and Gf{.) is an operator that maps 
Pepcc-bit vectors into GF(gepcc) symbols. 
Step 2 (Double occurrences): 

j{Sgi = a^'^) = 



7 {Sgi = a^-i) 



L ^Pl 



V{fc,m} : D iEik),E{m:)) > Ejree, 
G/ {Hcpcc X © m © E{m)f) = a=-' 

Qepcc 



(8) 



where D is the error free distance between the two errors, 
Efree = — 1 is the error free distance of the channel 
beyond which the errors are independent. 

Step M (M occurrences): 
%Sgi = a^-i) = 



max 



7 {Sg^ = a- 
pT 



.M 



V{gi,g2, ••■,gM} : D {E{qs),E{qt)) > Efree, 



Qepc 



5=M 

«=1 



Gf (Hepcc X [0 Eiq,) ® c^'^'^-'Y) = a^-' 



(9) 



Step M + 1 (ML-signature reliability; computed so that 
the resulting signature p.m.f. sums to 1): 

^{Sgi = al^^^) = 



-max 





max* 7 [Sgi — a^^^) 



7 (Sgi 



^lSgi = al^^^) 



(11) 



j = -00, 1,..., Qepcc - 1; j 7^ PmL + 1- 



(12) 

• Step M + 2 (Normalization): 

^{Sgi = a^'-i) = ^Sgi = a^-') - ^{Sg, = a—). 

(13) 

In steps 1 through M, to calculate the log-likelihood of 
signature i assuming value a^~^, we sum the probabilities of 
all presumed single and multiple errors in the ML word whose 
signatures equal a^~^. This is equivalent to performing the 
max* operation in the log domain on error reUabihties dictated 
by C (E). However, to limit the complexity of this stage, we 
only use a truncated set of possible error combinations, in 
aU steps from 1 to M. Also, for signature values that do not 
correspond to any of the combinations, we set their reliability 
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to — cxD, or more precisely, a reasonably large negative value in 
practical decoder implementation. Since there are many such 
signature values, the corresponding constructed p.m.f. will be 
sparse. 

In step M + 1, the likelihood of the ML signature value is 
computed so that the p.m.f. of the tensor symbol signature 
sums to 1. In this step, the max* operation in ( fTTT l is a 
reflection of the fact that in previous steps, 1 through M, 
some multiple error occurrences have the same signature as 
the ML tensor symbol value. So, we have to account for such 
error instances in the running estimate of the ML signature 
reliability. These events correspond to cases where error events 
are not detectable by Hepcc, i e., they belong to the null space 
of Hepcc- In step M + 2, the mlLLR of the tensor symbol 
is centered around j{Sgi = 0) to prevent the qLDPC SPA 
messages from saturating after a few BP iterations. 

3) q-ary LDPC Decoding: Now, the sequence of signature 
mlLLRs is passed as multi-level channel observations to the 
qLDPC decoder We choose to implement the log-domain 
q-ary fast Fourier transform-based SPA (FFT-SPA) decoder 
in [35] for this purpose. The choice of log-domain decoding 
is essential, since if we use the signature p.m.f. as input, the 
SPA would run into numerical instability resulting from the 
sparse p.m.f. generated by the preceding stage. 

The LDPC output posteriori mlLLRs correspond to the 
signatures of tensor symbols, rather than the syndromes of 
errors expected by EPCC decoding. Similar to the decoder of 
T-EPCC-RS, error- syndrome Syn^ is the finite field sum of 
the LDPC's input channel observation of signature i, Sgf^, 
and output posteriori signature reliability, Sg^. Moreover, the 
addition of hard signatures corresponds to the convolution 
of their p.m.f.'s, and this convolution in probability domain 
corresponds to the following operation in log-domain: 



^ch — "~00, 0, Qepcc ~ 2, 
/5p — OO, 0, Qepcc 2. 



(14) 



The error-syndrome mlLLR is later normalized, similar to 
LDPC BP mILLR message normalization, according to: 



l{Syn\ 



) - liSynt 



V^e = -OO, 0, qepcc " 2. (15) 



4} EPCC Decoding: An error-syndrome will decode to 
many possible error events due to the low minimum distance of 
single-error correcting EPCC. However, EPCC relies on local 
channel side information to implement a list-decoding-like 
procedure that enhances its multiple error correction capability. 
Moreover, the short codeword length of EPCC reduces the 
probability of such multiple error occurrences considerably. To 
minimize power consumption, EPCC is turned on for a tensor 
symbol i only if the most likely value of the error-syndrome 



mlLLR is nonzero, i.e., argmax ^{Synf = a^) ^ a~°°, 

indicating that a resolvable error has occurred. After this, a few 
syndrome values, 3 in our case, most likely according to the 
mILLR, are decoded in parallel. For each of these syndromes, 
the list decoding algorithm goes as [18], [19]: 

• A test error word list is generated by inserting the most 
probable combination of local error patterns into the ML 
tensor symbol. 

« An array of parallel EPCC single-pattern correcting de- 
coders decodes the test words to produce a list of valid 
codewords that satisfy the current error-syndrome. 

< The probability of a candidate codeword is computed as 
the sum of likelihoods of its parent test-word and the 
error pattern separating the two. 

• Each candidate codeword probability is biased by the 
likelihood of the error-syndrome it is supposed to satisfy. 

In addition, when generating test words, we only combine 
independent error patterns that are separated by the error free 
distance of the ISI channel. 

5) Soft Bit-level Feedback LLR Calculation: The list of 
candidate codewords and probabilities are used to generate 
bit level-probabilities in a similar manner to [19], [27]. The 
conversion of word-level reliability into bit-level reliability for 
a given bit position can be done by grouping the candidate 
codewords into two groups, according to the binary value of 
the hard decision bit in that bit position, and then performing 
group-wise summing of the word-level probabilities. Three 
scenarios are possible for this calculation: 
(i) The candidate codewords do not all agree on the bit 
decision for location fc; then, given the list of codewords 
and their accompanying a posteriori probabilities, the 
reliability of the coded bit is evaluated as 



Ai. = los 



Scgs+ ^^(c|c,r) 



(16) 



where is the set of candidate codewords where Cfe = 
-fl, and is the set of candidate codewords where Ck = 

-1 

(ii) Although rare for such short codeword lengths, in the 
event that all codewords do agree on the decision for Ck, 
a method inspired by [27] is adopted for generating soft 
information as follows 



A/c = Z?'*'^'' X Xrnax X dk 



(17) 



where dk is the bipolar representation of the agreed- 
upon decision, Xmax is a preset value for the maximum 
reliability at convergence of turbo performance, and the 
multiplier /3'*<='' < 1 is a scaling factor <C 1 in 

the first global iterations and is increased to 1 as more 
global iterations are performed and the confidence in bit 
decisions improved. Thus, this back-off control process 
reduces the risk of error propagation, 
(iii) The heuristic scaling in ( [TtI i is again useful when EPCC 
is turned off for a tensor symbol, in case the most likely 
error-syndrome being 0. Then, the base hard value of the 
tensor symbol corresponds to the most likely error event 



10 



IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, VOL. 28, NO. 2, FEBRUARY 2010 



found as a side product in stage 2 of the T-EPCC-gLDPC 
decoder. 

C. Stopping Criterion for T-EPCC-qLDPC and RS Erasure 
Decoding 

Due to the ambiguity in mapping tensor symbols to signa- 
tures and syndromes to errors in stages 2 and 4 of the decoder, 
respectively, the possibility of non-targeted error patterns, or 
errors that have zero error-syndromes that are transparent to 
Hepcc, a second line of defense is essential to take care 
of undetected errors. Therefore, an outer RS code of small 
correction power tout is concatenated to T-EPCC-qLDPC to 
take care of the imperfections of the component EPCC. Several 
concurrent functions are offered by this code, including: 

• Stopping Flag: If the RS syndrome is zero, then, global 
iterations are halted and decisions are released. 

• Outer ECC: Attempt to correct residual errors at the 
output of EPCC after each global iteration. 

• Erasure Decoding: If the RS syndrome is nonzero, then, 
for those tensor symbols that EPCC was turned on, 
declare their bits as erasures. Next, find the corresponding 
RS symbol erasures, and attempt RS erasure decoding 
which is capable of correcting up to 2 x tout such erasures. 
In this case, T-EPCC acts as an error locating code. 

V. Simulation Results and Discussion 

We compare three coding systems based on LDPC: conven- 
tional binary LDPC, g-ary LDPC, and T-EPCC-qLDPC, where 
all the component LDPC codes are regular and constructed by 
PEG with a QC constraint. We study their sector error rate 
(SER) performance on the ideal equalized partial response 
target 1 + 0.85D corrupted by AWGN, and with coding rate 
penalty 101og]^o(l/^)- The nominal systems run at a coding 
rate of 0.9. The minimum SNR required to achieve reliable 
recording at this rate is 3.9 dB, estimated by following the 
same approach as in [28]. 

A. Single-level BLDPC & qLDPC Simulation Results 

In Fig. |6] we compare SER of the following LDPC codes, 
each constructed by PEG with a QC constraint: 

. A (4550, 4095) GF(2)-LDPC, of column weight 5, and 
circulant size 91 bits. The channel detector is a 2 state 
binai-y BCJR. 

. A (570, 510) Gi^(28)-LDPC, of codeword length 4560 
bits, column weight 2, and circulant size of 15 sym- 
bols. The channel detector is a symbol-BCJR with 256 
branches emanating from each of 2 states. 

. A (760, 684) Gi^(26)-LDPC, of codeword length 4560 
bits, column weight 2, and circulant size of 19 symbols. 
The channel detector is a symbol-BCJR with 64 branches 
emanating from each of 2 states. 

. A (775, 700) GF(26)-LDPC, of codeword length 4650 
bits, column weight 3, and circulant size of 25 symbols. 
The channel detector is a symbol-BCJR with 64 branches 
emanating from each of 2 states. 



For the binary LDPC turbo equalizer, we run a maximum of 
10 X 50 iterations, 10 global, and 50 LDPC BP iterations. 
For the q-ary turbo equalizers, on the other hand, we run 
a maximum of 3 x 50 iterations. A column weight of 2 
gives the best waterfall performance of q-ary LDPC. However, 
Gi^(2'5)-LDPC exhibits an error floor as early as at SER 
6 x 10^**, whereas a higher order field of GF{2^) does 
not show such a tendency down to 1 x 10~^. Nevertheless, 
the prohibitive complexity of GF(2*) symbol-BCJR makes 
Gi^(2®)-LDPC a more attractive choice. Still, we need to 
sacrifice GF(2®)-LDPC's waterfall performance gains to guar- 
antee a lower error floor. For that purpose, we move to a 
column weight 3 GF(2^)-LDPC that is 1.37 dB away at 
1 X 10~^ from the independent uniformly-distributed capacity 
Ci.u.D. of the channel [28], and 0.37 dB away from GF(2«)- 
LDPC a the same SER. In this simulation study, we have 
observed that while binary LDPC can gain up to 0.4 dB 
through 10 channel iterations before gain saturates, GF{2^)- 
LDPC and Gi^(2'^)-LDPC achieve very little iterative gain by 
going back to the channel, between 0.09 to 0.12 dB through 
3 channel iterations. One way to explain this phenomenon, 
is that symbol-level LDPC decoding divides the bit stream 
into LDPC symbols that capture the error events introduced 
by the channel detector, rendering the binary inter-symbol 
interference limited channel into a memoryless multi-level 
AWGN limited channel. Nonetheless, error events spanning 
symbol boundaries reintroduce correlations between LDPC 
symbols that are broken only by going back to the channel. 
In other words, if it was not due to such boundary effects, 
a q-ary LDPC equalizer would not exhibit any iterative turbo 
gain whatsoever. Nonetheless, full-blown symbol BCJR is still 
too complex to justify salvaging the small iterative gain by 
performing extra channel iterations [33]. This is where error 
event matched decoding comes into the picture, which leads 
us to the results of the next section. 

B. T-EPCC-qLDPC Simulation Results 

We first construct two T-EPCC-qLDPC codes of rate 0.9, 
the same rate as the competing single-level qLDPC. These 
TPPC's are based on EPCC (12,6) of example 1. The codes 
constructed are: 

. TPPC-A: A 1/2 KB sector, binary (4680,4212) TPPC, 
of rate 0.9, and 468 parity bits, based on a component 
(390, 312) PEG-optimized QC Gi^(26)-LDPC, of rate 
0.8, column weight 3, and circulant size 26. 
. TPPC-B: A 1 KB sector, binai-y (9360, 8424) TPPC, of 
rate 0.9, and 936 parity bits, based on a component 
(780, 624) PEG-optimized QC Gi^(26)-LDPC, of rate 
0.8, column weight 3, and circulant size 52. 
First, we study the SER of T-EPCC-qLDPC just up to the 
component GF{2^) LDPC decoder, and only at the first chan- 
nel pass. This SER is function of the Viterbi symbol error rate, 
and the accuracy of generating signature mlLLRs, in addition 
to the component LDPC employed. This SER represents the 
best that the TPPC code can do, under the assumption of 
perfect component EPCC, i.e., as long as qLDPC generates 
a clean codeword of signature-symbols, then EPCC generates 
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Fig. 6. Comparing SER of: 10 X 50 iterations of binary LDPC, 3 X 50 
iterations of GF(2*)-LDPC of column weight 2, and 3 X 50 iterations of 
GF(26)-LDPC of column weights 2 and 3. Minimum SNR to achieve reliable 
recording at coding rate 0.9 is 3.9 dB for 1 + 0.85D. 

a clean codeword of data-symbols. Fig.|7]shows the ideal SER 
of these two TPPC codes, assuming perfect EPCC, compared 
to single-level GF{2^)-LDPC and GF{2^)-LDPC. Ideal 1/2 
KB TPPC has about the same SER as single level GF{2'^)- 
LDPC at 3 X 10"^ SER. In 1/2 KB TPPC, the component 
Gi^(2'5)-LDPC has half the codeword length of the single 
level counterpart, saving 50% of the decoder complexity, while 
delivering similar SER performance. The TPPC component 
qLDPC faces a harsher channel than single-level gLDPC, 
because the symbol error probability of 6-bit data symbols is 
strictly less than the symbol error probability of 6-bit signature 
symbols, where signature symbols are compressed down from 
12-bit data symbols. Also, the shorter codeword length of 
component (/LDPC hurts its minimum distance. Still, these 
impairments are effectively compensated for by an 11.4% 
increase in the redundancy of the TPPC component LDPC. 
On the other hand, if we match the codeword length of 
TPPC's component LDPC to single-level LDPC, as part of 
constructing 1 KB TPPC, then, 1 KB TPPC will have similar- 
decoder complexity to 1/2 KB single-level LDPC with about 
0.2 dB SNR advantage for 1 KB ideal TPPC at 3 x 10"^ SER. 

Due to the imperfections of EPCC design, including mis- 
correction due to one-to-many syndrome to error position 
mapping, and undetected errors due to EPCC's small minimum 
distance, achieving the ideal performance in Fig. |7] is not 
possible in one channel pass. In addition, an outer code is 
necessary to protect against undetected errors and provide a 
stopping flag for the iterative decoder Hence, one can think of 
an implementation of the fuH T-EPCC-GF(26)LDPC decoder 
that includes an outer t = 6 (421,409) RS for the 1/2 KB 
case, and an outer t = 12 (842, 818) RS for the 1 KB case, 
so as to protect against EPCC residual errors. These outer RS 



codes are defined on GF{2^'^) and have rate 0.972. However, 
this concatenation setup will run at a lower code rate of 
0.875, which can incur an SNR degradation larger than 0.25 
dB for a noise environment characterized by the rate penalty 
101ogio(l/i?''), 5 >2.ln a more thoughtful approach, one 
can preserve the nominal code rate of 0.9 and redistribute the 
redundancy between the inner TPPC and outer RS to achieve 
an improved tradeoff between miscorrection probability and 
the inner TPPC's component LDPC code strength. In that 
spirit, we construct the following concatenated codes: 

. TPPC-C: A 1/2 KB sector, binary (4560,4218) TPPC, 
of rate 0.925, and 342 parity bits, based on a component 
(380, 323) PEG-optimized QC GF{2'^)-LDPC, of rate 
0.85, column weight 3, and circulant size 19. An outer 
t = 6 (422,410) RS code of rate 0.972 is included, 
resulting in a total system rate of 0.9. 

. TPPC-D: A 1 KB sector, binary (9120,8436) TPPC, of 
rate 0.925, and 684 parity bits, based on a component 
(760, 646) PEG-optimized QC Gi^(26)-LDPC, of rate 
0.85, column weight 3, and circulant size 38. An outer 
t = 12 (844,820) RS code of rate 0.972 is included, 
resulting in a total system rate of 0.9. 

The control mechanism of iterative decoding for these codes 
is as follows: if EPCC results in less than 6 RS symbol errors 
for the 1/2 KB design or less than 12 for the 1 KB design, 
or if EPCC generates more errors than this, but declares less 
than 12 erasures for 1/2 KB or 24 erasures for 1 KB, then, 
decoding halts and decisions are released. Otherwise, one more 
channel iteration is done by passing EPCC soft bit-level LLR's 
to Viterbi detection and the bank of error-matched correlators. 

Simulation results in Fig. |8] for a noise environment of rate 
penalty lOlogj^Q 1/i?, demonstrate that after 3 channel itera- 
tions, the ideal and practical performances of the new TPPC 
codes almost lock, while incurring minimal SNR degradation. 
Also, 1/2 KB TPPC saves 50% of decoder complexity while 
achieving the same SER performance as single level LDPC for 
an additional SNR cost of 0.04 dB at SER 1 x 10"^. Hence, 
TPPC-C represents a tradeoff between the lower complexity of 
GF(2)-LDPC and performance advantage of GF(26)-LDPC, 
whereas 1 KB TPPC has the same decoding complexity as 
single-level LDPC while furnishing 0.18 dB gain at 5 x 10"^ 
SER. In terms of channel detector implementation complexity, 
the complexity and latency of GF(2®)-BCJR in the single 
level code far exceeds the overall complexity of the non-LDPC 
parts of two level T-EPCC-GF(2^)LDPC including Viterbi 
detection. At the same time, signature mlLLR generation, 
EPCC decoding, and bit-LLR generation are all implemented 
tensor-symbol by tensor-symbol, achieving full parallelism on 
the tensor-symbol level. Furthermore, it is only when qLDPC 
finds a syndrome error that EPCC decoding is turned on for 
each tensor symbol. To eliminate redundant computations in 
the iterative decoder, branch metric computation in Viterbi 
and (|5]l is only required at the first pass. For all subsequent 
iterations, however, only the a priori bias is updated in the 
second term of (|5]l, and the branch update of Viterbi [34]. 
One very important feature of the TPPC setup, that single- 
level LDPC lacks, is its robustness to boundary error events. 
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Fig. 7. Comparing SER of: 10 X 50 iterations of binary LDPC, 3 X 50 
iterations of GF(2*)-LDPC of column weight 2, 3x50 iterations of GF(2'')- 
LDPC of column weight 3, and x 50 iterations of ideal 1/2 KB and 1 KB 
T-EPCC-GF(2'')LDPC based on column weight 3 LDPC. 

The presence of a syndrome-constraint means that errors 
spanning boundaries are broken by EPCC when attempting to 
independently satisfy the adjacent tensor symbol syndromes, 
then, in the next turbo iteration, adjacent tensor-symbols are 
decorrelated. This mechanism enables TPPC to recover from 
these errors by iterative decoding. However, for errors with a 
zero error-syndrome which go undetected by EPCC, outer RS 
protection becomes handy. 

Based on the fact that TPPC enables an increase in the 
redundancy of its component LDPC, in addition to simulation 
results demonstrating the utility of such lowered rate in 
combating the harsher compressed channel, we conjecture that 
as the sector length of both TPPC and single-level LDPC is 
driven to infinity, TPPC will achieve strict error rate SNR 
gains. This is mainly because of its surplus of redundancy 
compared to the single level code at the same rate penalty, 
whereas channel conditions and EPCC correction power do not 
change with replication of tensor symbols, and the error rate 
performance of LDPC asymptotically approaches the noise 
threshold in the limit of infinite codeword length. Therefore, 
within a channel-capacity achieving argument, in the limit of 
infinite codeword length, we take the view that TPPC will 
bridge the gap to capacity further than any single level system 
could. Moreover, the advantage of TPPC for larger sector sizes 
is more timely than ever as the industry moves to the larger 4 
KB sector format [33]. 

VI. Conclusions 

In a tensor product setup, codes of short codeword length 
and low rate can be combined into high rate codes of nice 
algebraic properties. We showed that encoding of tensor prod- 
uct codes is linear time if the component codes are linear 
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Fig. 8. Comparing SER in environment of rate penaltylO logj^Q l/R: 10 X 50 
iterations of binary LDPC, 3 X 50 iterations of GF(2'*)-LDPC of column 
weight 2, 3 X 50 iterations of GF(2'^)-LDPC of column weight 3, and 3 X 50 
iterations of practical 1/2 KB T-EPCC-GF(2'5)LDPC+RS(t = 6), and 1 KB 
T-EPCC-GF(28)LDPC+RS(t = 12), both based on column weight 3 LDPC. 

time encodable. We also demonstrated how the codeword 
length and rate of channel matched EPCC can be substantially 
increased by combining with a strong RS or LDPC of short 
codeword length. We also incorporated an outer RS code 
of low correction power to clean out the residual errors of 
T-EPCC-RS or T-EPCC-LDPC TPPCs. In conclusion, this 
work established T-EPCC-gLDPC as a reasonable complexity 
approach to introducing non-binary LDPC to the perpendicular 
recording read channel architecture, paving the way to reliable 
higher recording densities. 
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